
[14] M. Lewis, Y. Liu, N. Goyal, et al., “BART: Denoising sequence-to-sequence
pre-training for natural language generation, translation, and comprehen-
sion”, in Proceedings of the 58th Annual Meeting of the Association for
Computational Linguistics, Online: Association for Computational Linguis-
tics, Jul. 2020, pp. 7871–7880. DOI: 10 . 18653 / v1 / 2020 . acl -
main . 703. [Online]. Available: https : / / aclanthology. org /
2020.acl-main.703.
[15] J. Devlin, M. Chang, K. Lee, and K. Toutanova, “BERT: pre-training of
deep bidirectional transformers for language understanding”.
[16] S. Clinchant, K. W. Jung, and V. Nikoulina, “On the use of BERT for neu-
ral machine translation”, in Proceedings of the 3rd Workshop on Neural
Generation and Translation, Hong Kong: Association for Computational
Linguistics, Nov. 2019, pp. 108–117. DOI: 10.18653/v1/D19-5611.
[Online]. Available: https://aclanthology.org/D19-5611.
[17] A. Graves and A. Graves, “Long short-term memory”, Supervised sequence
labelling with recurrent neural networks, pp. 37–45, 2012.
[18] T. Luong, H. Pham, and C. D. Manning, “Effective approaches to attention-
based neural machine translation”, in Proceeding of the 2015 Conference
on Empirical Methods in Natural Language Processing (EMNLP), The As-
sociation for Computational Linguistics, 2015, pp. 1412–1421. DOI: 10.
18653/v1/d15-1166.
[19] D. Bahdanau, K. Cho, and Y. Bengio, “Neural machine translation by jointly
learning to align and translate”, in 3rd International Conference on Learn-
ing Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Con-
ference Track Proceedings, Y. Bengio and Y. LeCun, Eds., 2015. [Online].
Available: http://arxiv.org/abs/1409.0473.
[20] A. Vaswani, N. Shazeer, N. Parmar, et al., “Attention is all you need”, in
Proceeding of the Advances in Neural Information Processing Systems 30:
Annual Conference on Neural Information Processing System (NeurIPS),
2017, pp. 5998–6008.
[21] J. Zhu, Y. Xia, L. Wu, et al., “Incorporating BERT into neural machine
translation”, in 8th International Conference on Learning Representations,
ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020, OpenReview.net,
2020. [Online]. Available: https://openreview.net/forum?id=
Hyl7ygStwB.
48